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Semi-algebraic colorings of complete graphs 


Jacob Fox * Janos Pach^ Andrew Suk^ 


Abstract 

In this paper, we consider edge colorings of the complete graph, where the vertices are points 
in and each color class Ei is defined by a semi-algebraic relation of constant complexity on 
the point set. One of our main results is a multicolor regularity lemma: For any 0 < e < 1, the 
vertex set of any such edge colored complete graph with m colors can be equitably partitioned 
into at most parts, such that all but at most an e-fraction of the pairs of parts are 

monochromatic between them. Here c > 0 is a constant that depends on the dimension d 
and the complexity of the semi-algebraic relations. This generalizes a theorem of Alon, Pach, 
Pinchasi, Radoicic and Sharir, and Fox, Pach, and Suk. As an application, we prove the following 
result on generalized Ramsey numbers for such semi-algebraic edge colorings. 

For fixed integers p and q with 2 < q < (p, a (p, q)-coloring is an edge-coloring of a complete 
graph in which every p vertices induce at least q distinct colors. The function f*{n,p,q) is the 
minimum integer m such that there is a (p, (?)-coloring of with at most m colors, where 
the vertices of are points in K'^, and each color class can be defined as a semi-algebraic 
relation of constant complexity. Here we show that /*(n,p, [logp] -|- 1) > H p ^ , and 

f*{n,p, [logp]) < O(logn), thus determining the exact value of q at which f*{n,p,q) changes 
from logarithmic to polynomial in n. 

We also show the following result on a distinct distances problem of Erdos. Let V be an 
n-element planar point set such that any p members of V determine at least ( 2 ) — p + 6 distinct 
distances. Then V determines at least distinct distances. 


1 Introduction 

Regularity lemma for multiple semi-algebraic relations. Szemeredi’s regularity lemma |35j 
is one of the most powerful tools in modern combinatorics. It was introduced by Szemeredi in his 
proof [ 33 ] of the Erdos-Turan conjecture on long arithmetic progressions in dense subsets of the 
integers. The lemma (see [23]) has since become a central tool in extremal combinatorics with many 
applications in number theory, graph theory, discrete geometry, and theoretical computer science. 

In its simplest version [35], the regularity lemma gives a rough structural characterization of 
all graphs. A partition is called equitable if any two parts differ in size by at most one. According 
to the lemma, for every e > 0 there is K = K(e) such that every graph has an equitable partition 
of its vertex set into at most K parts such that all but at most an e fraction of the pairs of parts 

‘Stanford University, Stanford, CA. Supported by a Packard Fellowship, by NSF CAREER award DMS 1352121, 
and by an Alfred P. Sloan Fellowship. Email: jacobfox@stcinford.edu. 

^EPFL, Lausanne and Courant Institute, New York, NY. Supported by Hungarian Science Foundation EuroGIGA 
Grant OTKA NN 102029, by Swiss National Science Foundation Grants 200020- 144531 and 200021-137574. Email: 
pachScims.nyu.edu. 

^University of Illinois at Chicago, Chicago, IL. Supported by NSF grant DMS-1500153. Email: suk@uic.edu. 


I 




are e-regular0 The number of parts K grows extremely fast as function of 1/e. It follows from 
the proof that K(e) may be taken to be of an exponential tower of twos of height Gowers 

m used a probabilistic construction to show that such an enormous bound is indeed necessary. 
Consult 0, m, |15j for other proofs that improve on various aspects of the result. 

Alon et al. [T] (see also Fox et al. M) established a strengthening of the regularity lemma for 
point sets in equipped with a semi-algebraic relation E. To be more precise, let V be an ordered 
point set in and let E C (^). We say that E is a semi-algebraic relation on V with complexity 

at most t if there are at most t polynomials gi,... ,gs G M[xi,... ,X 2 d]) s < t, of degree at most t 
and a Boolean formula such that for vertices u,v £V such that u comes before v in the ordering, 

{u,v) e E 4^ ^{gi{u,v) >0-,...-,gsiu,v) >0) = 1. 

At the evaluation of g^{u,v), we substitute the variables xi,... with the coordinates of u, the 
variables x^+i,... ,X 2 d with the coordinates of v. We may assume that the semi-algebraic relation 
E is symmetric, i.e., for all points u,v € (u, v) € E if and only if (v, u) G E. Indeed, given such 

an ordered point set 1/ C and a not necessary symmetric semi-algebraic relation E of complexity 
at most t, we can define V* C with points {v, i) where v £V and v is the ith smallest element 
in the given ordering of V. Then we can define a symmetric semi-algebraic relation E* on the 
pairs of V* with complexity at most 2t 2, by comparing the value of the last coordinates of the 
two points, and checking the relation E using the first d coordinates of the two points. We will 
therefore assume throughout this paper that all semi-algebraic relations we consider are symmetric, 
and the vertices are not ordered. We also assume that the dimension d and complexity t are fixed 
parameters, and n = \ V\ tends to infinity. 

It was shown in [T] that any point set 1/ C equipped with a semi-algebraic relation E C (^) 
has an equitable partition into a bounded number of parts such that all but at most an e-fraction 
of the pairs of parts {V\, V 2 ) behave not only regularly, but homogeneously in the sense that either 
Vi X V 2 C Fi or Vi X V 2 n i? = 0. Their proof is essentially qualitative: it gives a poor estimate for 
the number of parts in such a partition. Fox, Pach, and Suk m gave a much stronger quantitative 
form of this result, showing that the number of parts can be taken to be polynomial in 1/e. 

In this paper, we consider points sets equipped with multiple binary relations. Let V be an n- 
element point set in equipped with semi-algebraic relations Ei,, Em such that EiL)- ■ -UEm = 
( 2 ). Then for any e > 0, an m-fold repeated application of the result of Fox, Pach, and Suk m 
gives an equitable partition of V into at most K < (1/e)'^™' parts such that all but an e-fraction of 
the pairs of parts are complete with respect to some relation E^. In Section [21 we strengthen this 
result by showing that the number of parts can be taken to be polynomial in m/e. 

Theorem 1.1. For any positive integers d,t > 1 there exists a constant c = c{d,t) > 0 with the 
following property. Let 0 < e < 1/2 and let V be an n-element point set in equipped with 
semi-algebraic relations Ei,...,Em such that Ei has complexity at most t and (^) = EiU - ■ ■ UEm- 
Then V has an equitable partition V = Vi U • • • U Vk into at most K < {m/eY parts such that all 
but an e-fraction of the pairs of parts are complete with respect to some relation Ei. 

Ramsey and generalized Ramsey numbers for semi-algebraic relations. In Section [3l we 
give an application of Theorem 11.11 to semi-algebraic multicolor Ramsey numbers. The Ramsey 

^For a pair (Vi, Vj) of vertex subsets, e(Vi, Vj) denotes the number of edges in the graph running between Vi and 
Vj. The density d(Vi,Vj) is defined as ■ Tii® (RiFi) i® called e-regular if for all P/ C Vi and V/ C Vj 

with |P/| > e|Fi| and |P/| > e\Vj\, we have \d(y( , V') — diVi, Vj)\ < e. 
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number R{p; m) is the smallest integer n such that any m-coloring on the edges of the complete 
n-vertex graph contains a monochromatic copy of Kp. We define analogously, where 

we restrict at our attention to graphs with vertices as points in and the colorings are given by 
semi-algebraic binary relations of complexity at most t. Here each edge receives at least one color. 
Clearly, < R{p]m). 

For the case p = 3, in establishing a famous theorem in additive number theory, Schur |29] 
proved that 


n{2'^) < R{3]m) <0{ml). (1) 

While the upper bound has remained unchanged over the last 100 years, the lower bound was 
successively improved and the current record is R{3]m) > H(3.199”*) due to Xiaodong et al. [39] . 

In [32], it was shown that for fixed d,t > 1, i?rf^t(3;m) = improving the upper 

bound in ([T|) in the semi-algebraic setting. Our next theorem extends this result to for 

any p > 3. 

Theorem 1.2. For fixed integers d, t > 1 and p>3, we have 

RdAP'^rn) = 

Here and throughout the rest of the paper, all logarithms are in base 2. 

The following natural generalization of the Ramsey function was first introduced by Erdos and 
Shelah (see 0), and studied by Erdos and Gyarfas in m- 

Definition 1.3. Eor integers p and q with 2 < q < ( 2 ); a {p,q)-coloring is an edge-coloring of a 
complete graph in which every p vertices induce at least q distinct colors. 

Let f{n,p, q) be the minimum integer m such that there is a {p, g')-coloring of Kn with at most m 
colors. When p and q are considered fixed integers and n tends to infinity, estimating f{n,p,q) 
can be a very challenging problem. Estimating f{n,p,2) is equivalent to estimating R{p]m) since 
f{n,p,2) is the inverse of R{p‘,m). In particular, 

Erdos and Gyarfas m showed that 

H < f{n,p,p) < O . 

Surprisingly, estimating f{n,p,p — 1) is much more difficult. In [IDj . Erdos and Gyarfas asked for p 
fixed if f{n,p,p-l) = The trivial lower bound is f{n,p,p-l) > f{n,p,2) > H 

Eor p = 4, this was improved using the probabilistic technique known as dependent random choice 
by Kostochka and Mubayi [H] and further by Eox and Sudakov [18] to /(n,4,3) > H(logre). This 
was later extended in Gonlon et al. [6] to f{n,p,p — 1) > H(logn). The Erdds-Gyarfas question 
was answered affirmatively by Mubayi [28] for p = 4 who found an elegant construction which 
implies /(n,4,3) < eO(Viog»^)^ Gonlon et al. [6] later answered the Erdds-Gyarfas question for all 
p, showing for p > 4 fixed that /(n,p,p — 1) < ^ p 2 +01 ^ While it is now known that 

/(n,p,p— 1) does not grow as a power in n, there remains a very large gap between the upper and 
lower bounds on /(n,p,p — 1). 
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Here we study a variant of the function f{n,p,q) for point sets H C equipped with semi- 
algebraic relations. More precisely, we define the function fd,tin,p,q) to be the minimum m such 
that there is a {p, g)-coloring of Kn with m colors, whose vertices can be chosen as points in 
and each color class can defined by a semi-algebraic relation on the point set with complexity at 
most t. Unlike in the definition of RddiP',i^), here we require that each edge receives exactly one 
color. 

Clearly we have f{n,p,q) < /d,t(u,p,g). The following result shows the exact moment when 
q) changes between a power of n to logarithmic in n. 

Theorem 1.4. For fixed integers d, t > 1, there is a c = c{d,t) > 0 such that for p > 3, we have 

fd,t{n,p, \logp] -M) > 0 . 

Moreover, for t > 4, 


fd,t{n,p, [logp]) < O(logn). 

The proof of Theorem 11.41 is given in Section [3l 

Distinct distances. In his classic 1946 paper [7|, Erdos asked what is the minimum number 
of distinct distances determined by an n-element planar point set V. He showed that a ifn x y/n 
integer lattice determines ©(n/Vlogn) distinct distances, and conjectured that any n-element point 
set determines at least distinct distances. Several authors established lower bounds for this 

problem, and Guth and Katz [20] answered Erdos’ question by proving that any n-element planar 
point set determines at least n(n/logn) distinct distances. 

In [10] . Erdos and Gyarfas studied the following generalization. For integers p and q with 
p < q < ( 2 ), let D{n,p,q) denote the minimum number of distinct distances determined by a 
planar n-element point set V with the property that any p points from V determines at at least q 
distinct distances. Notice by definition 

D{n,p,q) > fd,t{n,p,q) > f{n,p,q), 

and by m we always have D{n,p,q) > ^(n/logn). A simple argument shows that D(n, 4, 5) > 
Q{n), and clearly D(n,4, 5) < O(n^). It is an old question of Erdos (see [9] and [3]) to determine if 
D(n,4,5) is superlinear or o(n^). Likewise, the best known lower and upper bounds for D(n, 5,9) 
are of the form 11 (n) and O(n^). 

Observe that, for every hxed integer p > 2, we have D (n,p, (^) — \j)/2\ -|- 2) > H(n^), since 
any distance can occur at most [p/2\ — 1 times in this setting. In [10] . Erdos and Gyarfas showed 
that for p > 7 


D (^n,p, Q - [p/2j +1^ >f (^n,p, Q - [p/2\ + > n ■ 


Sarkozy and Selkow m showed that 


D (^n,p, Q -p + llogp] + ^ > f (^n,p, Q + l^ogp] + 4^ > n (n^+") , 
where e = e{p). Our next result establishes a better bound for this distinct distance problem. 
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Theorem 1.5. For fixed p > 6, we have 

The proof of Theorem 11.51 appears in Section [5l 

Ramsey-Turan numbers for semi-algebraic relations. The classical theorem of Turan gives 
the maximum number of edges in a Kp-free graph on n vertices. 

Theorem 1.6 (Turan, [M])- Let G = {V,E) be a Kp-free graph with n vertices. Then 

The only graph for which this bound is tight is the complete {p — l)-partite graph whose parts are 
of size as equal as possible. This graph can easily be realized as an intersection graph of segments 
in the plane, which is a semi-algebraic graph with complexity at most four. Therefore, restricting 
the problem to semi-algebraic graphs does not change Turan’s theorem. 

Let iL be a fixed graph. The Ramsey-Turan number RT(n,iL, a) is the maximum number of 
edges an n-vertex graph G can have without containing H as a subgraph and without having an 
independent set of size a. Ramsey-Turan numbers were introduced by Sos and were motivated 
by the classical theorems of Ramsey and Turan and their connections to geometry, analysis, and 
number theory. One of the earliest results in Ramsey-Turan Theory a ppeared in m, which says 
that for p >2 

BT{n,K2p-i,o{n)) = ^ 

For the even cases, a celebrated result of Szemeredi from 1972 [Ml states that 

RT(n, K4, o(n)) = + o(n^), 

O 

and a more general result of Erdos, Hajnal, Sos and Szemeredi HH states that 

RT(n, K2p, o(n)) = ^ + o(n^)- 

For more results in Ramsey-Turan theory, see a nice survey by Simonovits and Sos m- 

Here we give tight bounds on Ramsey-Turan numbers for semi-algebraic graphs, showing that 
the extremal densities for even cliques are different in the semi-algebraic setting than in the classical 
setting, and the densities for K 2 p are the same as for K 2 p-i. Let RT(i^i(n, Kp, o(n)) be the maximum 
number of edges a graph G = (T, E) can have, where T is a set of n points in and E C (^) is a 
semi-algebraic relation with complexity at most t, without containing Kp as a subgraph and also 
without having an independent set of size o(n). 

Theorem 1.7. For fixed integers d > 4, t > 10 and p>2, we have 

RTd,t(n, K2p-i,o{n)) = BTa^tin, K2p, o(n)) = ^ + o{n^). 
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In contrast to Turan’s problem, the answer to the Ramsey-Turan question for Rr 2 p-free graphs 
changes if we require the graph to be semi-algebraic with bounded complexity. We note that we do 
not need the full strength of Theorem 1 1.1 1 to establish Theorem 1 1.71 and one could use the regularity 
lemma established by Alon et al. [T] . The proof of Theorem 11.71 is given in Section [ 6 l 

We sometimes omit floor and ceiling signs whenever they are not crucial for the sake of clarity 
of presentation. 

2 Multicolor semi-algebraic regularity lemma 

In this section, we prove Theorem 11.11 A set A C is semi-algebraic if there are polynomials 
gi,... ,gt and a boolean formula such that 

A = {x E : ^igi{x) > 0;... ; gt{x) > 0) = 1}. 

We say that a semi-algebraic set in d-space has description complexity at most t if the number of 
inequalities is at most t, and each polynomial gi has degree at most t. Let cr C be a surface 
in that is, a is the zero set of some polynomial h E M[xi,..., x^]. The degree of a surface 
cr = {x E : h{x) = 0} is the degree of the polynomial h. We say that the surface ct C crosses 
a semi-algebraic set A if an A 7 ^ 0 and A (fi a. We now recall an old theorem known as the cutting 
lemma. 

Lemma 2.1 (Chazelle et al. [3], Koltun [22]). For d > 1, let Yi be a multiset of surfaces in 
each of which has degree at most t, and let r be an integer parameter such that 1 < r < N. Then 
there is a subdivision ofW^ into at most c\r‘^^ semi-algebraic sets Aj such that ci = ci{d,t), each 
Aj has complexity at most ci, and at most \Y\/r surfaces from Y cross Aj. In the case that d > 4, 
the bound on the number of semi-algebraic sets Aj in the subdivision of can he improved to 

We note that we will use the weaker bound of in the lemma above for this section, and use 
the stronger bound in Section [1.51 

We will first prove the following variant of Theorem ll.il from which Theorem 11.11 quickly follows. 

Theorem 2.2. For any e > 0, every n-element point set V C equipped with semi-algebraic 
binary relations Lii,..., Em C (^) with = Lii U • • • U Em and each Ei has complexity at most t, 
can be partitioned into K < parts V = lA U • • • U Vk, where C 2 = C 2 {d,t), such that 



where the sum is taken over all pairs (i,j} such that {Vi,Vj) is not complete with respect to E^ for 
all i = 1,... ,m. 

Proof. For each relation Et, let gi^i,... ,gi^t £ M[xi,... ,X 2 d] be polynomials of degree t, and let <I>j 
be a boolean formula such that 


{u,v)eEi > 0;... ;5j,t(u,x) > 0) = 1. 

For each point x E M'^, i E {1,... , m}, and j E {1,..., t}, we define the surface 

= {y E M"' : gi,j{x,y) = 0 }. 
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Let S be the family of tmn surfaces in defined by 


S = {(Tij(tt) ■. u & V,1 < i < m,l < j < t}. 

We apply Lemma 12.11 to S with parameter r = tm/e, and subdivide into s < ci 
semi-algebraic sets A^, with each having complexity at most ci, where ci is defined in Lemma 
EH Moreover, at most tmn/r = en surfaces from S cross A^ for every This implies that at most 
en points in V give rise to at least one surface in S that cross A^. 

Let [/£ = V n A£ for each i < s. We now partition A^ as follows. For i E {1,... ,m} and 
j E {1,..., s}, define A^^jj C by 

= {ic E A^ : crj^i(a:) U • • • U crosses Aj}- 

Observation 2.3. For any i, j, and i, the semi-algebraic set A^^jj has complexity at most C 3 = 

C3{d,t). 

Proof. Set o'j(x) = ai^i{x) U • • • U which is a semi-algebraic set with complexity at most 

C 4 = C 4 (d, t). Then 

. 3yi E s.t. yi E crj(x) n A,-, and 

We can apply quantifier elimination (see Theorem 2.74 in [2]) to make A^^jj quantifier-free, 
with description complexity at most C 3 = C 3 {d,t). 

□ 

Set iFi = : I < i < m,l < j < s}. We partition the points in Ui into equivalence classes, 

where two points u,v ^ Ui are equivalent if and only if u belongs to the same members of Ti as v 
does. Since Ti gives rise to at most C 3 |J^£| polynomials of degree at most C 3 , by the Milnor-Thom 
theorem (see [26] Chapter 6 ), the number of distinct sign patterns of these C 3 |J^£| polynomials is 
at most (50 c3(c 3|J^£|))'^ . Hence, there is a constant C 5 = C 5 (d, t) such that Fi is partitioned into at 
most C 3 {msY equivalence classes. After repeating this procedure to each t/^, we obtain a partition 
of our point set H = Hi U • • • U Vk with 

K < sc^imsY = C5m'^s'^+^ < ^ ^ 

where we define C 2 = cf^^. 

For fixed i, consider the part H- Then there is a semi-algebraic set A^. obtained from Lemma 
12 .II such that = V H A^. and H C C A^.. Now consider all other parts Vj such that not all of 
their elements are related to every element of H with respect to any relation E^. where 1 < A: < m. 
Then each point u & Vj gives rise to a surface in S that crosses A^.. By Lemma 12.11 the total 
number of such points in V is at most en. Therefore, we have 

= < men, 

j j 

where the sum is over all j such that VixVj is not contained in the relation Ei for any i. Summing 
over all i, we have 
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where the sum is taken over all pairs i,j such that iVi,Vj) is not complete with respect to Ei for 


Proof of Theorem \1.1[ Apply Theorem 12.21 with approximation parameter e/2. So there is a 
partition Q : V = UiU- ■ -UUk' into K' < (mje)^ parts with c = c{d,t) and X] 1^*11I — 
where the sum is taken over all pairs {i,j) such that (Ui, Uj) is not complete with respect to for 
all 1. 

Let K = 8e“^iLh Partition each part Ui into parts of size |y|/Ar and possibly one additional 
part of size less than |P|/Ar. Collect these additional parts and divide them into parts of size 
to obtain an equitable partition P : 1/ = Vi U • • • U Vk into K parts. The number of vertices of 
V which are in parts Vi that are not contained in a part of Q is at most K'\V\/K. Hence, the 
fraction of pairs x with not all Vi, Vj are subsets of parts of Q is at most 2K' jK = e/4. As 
e/2 + e/4 < e, we obtain that less than an e-fraction of the pairs of parts of V are not complete 
with respect to any relation Ei,..., E^- D 


3 Generalized Ramsey numbers for semi-algebraic colorings 

The goal of this section is to prove Theorem 11.41 Let P be a set of points in equipped with 
semi-algebraic relations Ei,..., Em such that each Ei has complexity at most t, (^) = EiU- ■ - UEm, 
and Ei CiEj = id for all i 7 ^ j. Let Si, 52 C P be /c-element subsets of P. We say that Si and S 2 are 
isomorphic, denoted by Si ~ S 2 , if there is a bijective function /i : Si —S 2 such that for u, u G Si 
we have {u,v) G Ei if and only if {h{u),h{v)) G Ei. 

Let S C P be such that |S| = 2 * for some positive integer s. We say that S is s-layered if s = 1 
or if there is a partition S = Si U S 2 such that |Si| = IS 2 I = 2 ®“^, Si and S 2 are (s — l)-layered, 
Si ~ S 2 , and for all u G Si and u G S 2 we have {u,v) G Ei for some fixed i. Notice that given an 
s-layered set S, there are at most s relations Liq,..., Ei^ such that ( 2 ) C Liq U • • • U Ei^. Hence, 
the lower bound in Theorem [Til is a direct consequence of the following result. 

Theorem 3.1. Let s > 1 and let V be an n-element point set in equipped with semi-algebraic 

relations Ei,..., Em such that each Ei has complexity at most t, Ei\J- ■ -VdEm = (^), and EiCiEj = id 

1 

for all i j. If rn < , then there is a subset S C P such that |S| = 2 ® and S is s-layered, 

where c = c{d, t). 


Proof. We proceed by induction on s. The base case s = 1 is trivial. For the inductive step, 
assume that the statement holds for s' < s. We will specify c = c{d, t) later. We start by applying 
Theorem 11.11 with parameter e = ^ to the point set P, which is equipped with semi-algebraic 
relations Ei,..., Em, and obtain an equitable partition P : P = Pi U • • • U Vk, where 


K<C2 



< C2m 


10s(P 


and C 2 = C 2 {d,t). Since all but an e fraction of the pairs of parts V are complete with respect to 
E^ for some I, by Turan’s theorem (see Theorem [IT] in Section 1), there are m®“^ -|- 1 parts Vf ^ P 


such that each pair (V^,Vj) G V x V is complete with respect to some relation Ei. Since V is an 
equitable partition, we have \ V-\ > -By picking c = c(d,t) sufficiently large, we have 


\Vl\E^ > 


n 


C2m 


10 ^ 2 . 


c(s-l)^ 


> m 


— 10c2d2 s 

=(s-i)^ > rn. 


By the induction hypothesis, each contains an (s — l)-layered set Si for z G {1,... + 1}. 

By the pigeonhole principle, there are two (s — l)-layered sets Si,Sj such that Si ~ Sj. Since 
Si X Sj C Ei for some I, the set 5 = S'* U Sj is an s-layered set. This completes the proof. 

□ 


To prove the upper bound for fd,t{'n,p, [logp]), when d > 1 and t > 100, it is sufficient to 
construct a 2™'-element point set y C M equipped with semi-algebraic relations Ei,... ,Em that 
is m-layered. More precisely, for each integer m > 1, we construct a set Vm of 2™ points in M 
equipped with semi-algebraic relations Ei,..., Em such that 


1 . Vm with respect to relations Ei,. ■ ■, Em is m-layered, 

2. U • • • U Em = (^) is a partition, 

3. each Ei has complexity at most four, and 

4. each Ei is shift invariant, that is (n, v) G Ei if and only if (tt -|- c, y -|- c) G Ei for c G M. 

We start by setting Vi = {1,2} and defining Ei = {u,v G Vi ■. \u — v\ = 1}. Having defined 
the point set Vi and relations Ei,..., Ei, we define Hj+i and Ei^i as follows. Let C = C{i) be a 
sufficiently large integer such that C > lOmax^gy. u. Then we have Hj+i = Vi U (Vi -|- C), where 
Vi -|- C is a translated copy of Vi. We now define the relation L^i+i by 


{u, v) G Ei^i C/2 < |rt — u| < 2C. 

Hence, Vi+i with respect to relations Ei,... ,Ei^i satisfies the properties stated above and is 
clearly (z -|- l)-layered. One can easily check that any set of p points in Vm induces at least [logp] 
distinct relations (colors). 


4 Ramsey numbers with respect to semi-algebraic colorings 

In this section, we prove Theorem 11.21 Consider all n-element point set V in equipped with 
symmetric semi-algebraic relations Ei,... ,Em C (^), such that each Ei has complexity at most 
t and (^) = U • • • U Em- Note that the binary relations Ei are not necessarily disjoint. We 
define the Ramsey function Rd,tiPi: ■ ■ ■ ^Pm) to be the minimum integer n such that for every n- 
element point set with these properties, there exists an z such that we can find pi points with 
every pair induced by them belonging to Ei. In particular, if pi = P 2 = • • • = Pm = P, then 
Rd,t{pi, ■ ■ ■ ,Pm) = Rd,t{p'-,'ni). The following result thus implies Theorem 11.21 

Theorem 4.1. For fixed integers d,t > 1, and Pi > 3 for 1 < i < m, we have 

RdAPl: • • • ,Pm) < 

where c = c{d, t). 
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Proof. Let c = c{d, t) be a sufficiently large constant that will be determined later. We proceed by 
induction on m and s = base case when m = 1 is trivial. Now assume m > 2, and 

the statement holds for m! < m. If s < 3m, then there is an i such that pi < 2. We can therefore 
reduce the problem to m — 1 colors, and the statement follows by induction. Hence we can assume 
s > 3m, and the statement holds for s' < s. 

Set n = ^ semi-algebraic relations on P such 

that = Si U • • • U Em, and each Sj has complexity at most t. Note that the relations Sj may 
not be disjoint. For the sake of contradiction, suppose that for every i G {1,... ,m}, V does not 
contain pi points such that every pair of distinct points induced by them is in Sj. 

For each relation Sj, there are t polynomials gip,... ,gi^t of degree at most t, and a Boolean 
function such that 


{u,v)eEi 4^ ^i{gi,i{u,v) >0,... ,gi^t{u,v) >0) = 1. 


For 1 < i < m, 1 < j < t, u G F, we define the surface a'ij{v) = {x G : gij{v,x) 
let 


S = {<Tij{v) : 1 < i < m,l < j < t,v £ V}. 


0 }, and 


Hence, |S| = mtn. We apply Lemma l2.ll to S with parameter r = 2tm, and decompose into 
ci(2tm)^'^ regions Aj, where ci = ci{t,d) is defined in Lemma 12.11 such that each region Aj is 
crossed by at most tmn/r = n/2 members in S. By the pigeonhole principle, there is a region 
A C M^, such that |A n F| > most n/2 members in S crosses A. Let Fi be a set 


of exactly 


Cl (2tm)2d 


points in F n A, and let V 2 be the set of points in F \ Fi that do not give 


rise to a surface that crosses A. Hence 


IF 2 I >n 


n 

ci(2tm)^'^ 



Therefore, each point u G V 2 has the property that v x Vi G Ei for some fixed i. We define the 
function x ^ F 2 —>• {1 ,..., m}, such that = * if and only if i is the minimum index for which 
u X Fi C Ei. Set I = {x(^) : V £ ^ 2 } and mo = |L|. That is, mg is the number of distinct relations 
(colors) between the sets Fi and F 2 . Now the proof falls into two cases. 


Case 1. Suppose mo > logm. If i G /, then Fi does not contain pi — 1 elements such that every 
pair induced by them belongs to Ei. By the induction hypothesis, we have 


2'^(SIiiPi) loglog(m-|-2) 

ci(2tm)2'^ 




Hence, 


cmologlog(m-|-2) < log(ci(2tm)^‘^) < 2filog(ci2tm), 

which implies 


^ 2cilog(ci2tm) 

“ c log log(m -|- 2) ’ 

and we have a contradiction for sufficiently large c = c{d,t). 
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Case 2. Suppose uiq < logm. By the pigeonhole principle, there is a subset V 3 C V 2 , such that 
IV 3 I > and Vi X V 3 C Ei for some fixed i. Hence, V 3 does not contain Pi — 1 points such that 
every pair induced by them is in E^. By the induction hypothesis, we have 


Therefore, 


Pi) log log(m+2) 

4mo 


< 11/31 < 


cloglog(m + 2 ) < log(4mo) < log(41ogm), 

which is a contradiction since c is sufficiently large. This completes the proof of Theorem 11.21 

□ 


5 A superlinear lower bound on distinct distances 

In this section, we prove Theorem 11.51 We need the following result in extremal graph theory. 

Theorem 5.1 (K6vari-S6s-Turan [25], Erdos). Let G = {U,V,E) be a bipartite graph such that 
\U\ = m and |I4| = n. If G does not contain the subgraph K 2 ^r with 2 vertices in U and r vertices 
in V, then 

|E'(G)| < 0{m\/n + n), 
where the hidden constant depends on r. 

In particular, we have that for any fixed r, all ii' 2 ,r'-free graphs on n vertices have a most 0(n^/^) 
edges. Our next result improves this upper bound under the additional condition that the edge set 
E of the graph is a semi-algebraic relation with constant description complexity. A similar theorem 
was proved in |16] for graphs G = {V,E) with V C where d <r). 

Theorem 5.2. For fixed d > 4, r > 2 and t > 1, let V C be an n-element point set equipped 
with a semi-algebraic relation E C (^) such that E has complexity at most t. If {V, E) is K 2 ^r-fi"ee, 

then \E\ < O ^ ^ where e is an arbitrarily small positive constant. 

Proof. Since E is semi-algebraic of description complexity at most t, there are polynomials gi,... ,gt 
and a Boolean formula $ such that for u,v £V, 


{u,v)£E 4^ ^{gi{u,v) >0,... ,gt{u,v) >0) = 1. 

For each point v G V, let ai^y = {x G : gi{x, v) = 0}, 1 < i < t. Set S = {ai^y : 1 < i < t,v G V}. 
Note that |S| = tn. 

For r = 77,i/(4<^-9+2e)^ apply Lemma l2.II to S, which partitions into at most 

2H—A4-F 2d-4+e 

Cir ^ = Cin4rf-9+2e 


cells Aj, where ci = ci{t) and e is arbitrarily small, such that each cell is crossed by at most |S|/r 
surfaces from S. By the pigeonhole principle, there is a cell A C that contains at least 


n 


2d-4+e 

Cin4d-9+2e 


-1 _ 2d—4+e 
(1/Cl)n 4d-9+2e 
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2d-4-\-e 


points from V. Let V' <ZV he a. set of exactly (l/ci)n^ 4d-9+2e points in Fn A. We can assume 

that n is sufficiently large so that \V'\ > 2 holds. Let U he the set of points in V that gives 
rise to at least one surface in S that crosses A. By the cutting lemma, 


tn 1 1 

\U\ < - = Cn 4d-9+2e . 

r 

Since {V,E) is iL 2 ,r-free, Theorem 15.11 implies that 

\E{U,V')\ < + \U\) < c3n^-434+27, 

where C 3 = C 3 {r,t). Hence there is a point v ^V' such that v has at most 


|W| 


< C4n2 


1 

8d-18+4e 


neighbors in U. Since G is iL 2 ,r-free, there are at most r — 1 points in V\{UiJV') that are neighbors 
of V. Hence, 


|A^g(' 1 ')| < \ v'\ + {r - 1) + O sd-is+fe^ <o{n^^ gd-i8+4e^ . 

We remove v and repeat this argument until no vertices remain in V. Then we have that 

\EiG)\ < O ^n^“8‘i-18+4E^ . 


□ 


We will use the d = 4 special case of Theorem 15.21 We also need the following simple lemma. 

Lemma 5.3. Let G = {V, E) he a graph with maximum degree p. Then G contains a matching of 
size \E\/{2p). 

Proof. We proceed by induction on \E\. The base case \E\ = 1 is trivial. Now assume the statement 
holds for all graphs with fewer than \E\ edges. We build our matching greedily. Let uv G E{G) be 
the first edge in our matching, and delete all other edges incident to u and v. Hence we have at 
least \E\ — 2p + 1 edges remaining, and the statement follows by the induction hypothesis. 

□ 


We are now ready to prove Theorem 11.51 
Proof of Theorem 11.51 Here we will show that 

^(n,p,Q - p + e) 

where 5 is an arbitrarily small constant. Let H be a planar n-element point that determines 
m distinct distances di,... ,dm, with the property that any p points from V determine at least 
q = (^) — p + 6 distinct distances, where p > 6. Let Ei C (^) such that uv G Ei if and only 
if \u — v\ = di, and let Gi = {V,Ei). Notice that Gi has maximum degree smaller than p, since 
otherwise we would have p points determining at most ( 2 ) —p+ f < q distinct distances. By Lemma 
lOl there is a matching E^ C Ei of size \Ei\/{2p) for every i. 
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Let U = V X V C and we define the relation Q C (^) such that {uiU 2 ,viV 2 ) G E if and 
only if ui,U 2 ,vi,V 2 are all distinct and |rii — fi| = \u 2 — V 2 \ or \ui — U2I = \u 2 — fil- Clearly, Q is a 
semi-algebraic relation with description complexity at most four, where t is an absolute constant. 
By the Cauchy-Schwarz inequality, we have 


m\Q\ > m 

i=l 


> 


mA 


2=1 


P 




Ei^<i 


V 2=1 


^ — 2 n^ 

- (2p)2 


which implies 


— 2n^ 

TY) ^ _ 

- (2p)2|Q|• 


(3) 


Hence, it suffices to bound \Q\ from above. By a standard probabilistic argument, we can partition 
U = UiL)U 2 such that at least half of the edges in Q are between Ui and U 2 and |f7i|, |f72| > Ll^l/2j • 
Therefore, it is enough to bound the number of edges between Ui and C/2. 

Fix a vertex ^1^2 G Ui, and let N{uiU 2 ) C U 2 such that N{uiU 2 ) = {viV 2 G U 2 : {uiU 2 ,viV 2 ) G 
Q}. Consider the graph G' = {V, N{U 1 U 2 ))■ We claim that G' has maximum degree less than p — 3. 
Indeed, suppose there is a vertex v with degree p — 3 in G' with neighbors tci,..., Wp- 3 . 

Observation 5.4. The points ui,U 2 ,v, wi,... Wp -3 determine at most ( 2 ) —p+3 distinct distances. 


Proof. We proceed by induction on p. The base case p = A follows since {uiU 2 ,vwi) G Q. Now 
assume that the statement holds up to p — 1. By the induction hypothesis, ui,U 2 ,v, wi,..., Wp-i 
determine at most 


(^2 ;-b-l)+3=Q-2(p-l) + 3 


distinct distances. Since (uitt2, utCp-s) G Q, we have either |tti — u| = |n2 — Wp-^l or |tti — Wp-^l = 
|n2 — u|. Thus, adding Wp -3 introduces at most p — 2 new distances. Therefore, ui,U 2 ,v,wi,... Wp -3 
determine at most 


-2(p-i) + 3 + (p-2)= -p + : 


distinct distances. 


□ 


Since G' = (H, N{uiU 2 )) has maximum degree at most p — 2, by Lemma [5l3l there is a subset 
N'{uiU 2 ) C N{uiU 2 ) of size forms a matching in G'. Now let Q' Q he the edges 

between Ui and U2 such that (uitt2,uiU2) G Q' if and only if (uiM2)^i^^2) G N'{uiU2). Notice we 
have 


\Q' 


\N’{uiU 2 )\ 

UlU2&Jl 


s E 

U1U2&U1 


\N{uiU2)\ 
2(p - 2) 


^ IQI 

- 4(p-2)’ 


and therefore it suffices to bound \Q' 


(4) 


Observation 5.5. If the bipartite graph {Ui,U 2 ,Q') contains the subgraph 2^2,r with two vertices 
in Ui and r vertices in U 2 , then there are 2r + 4 points in V that determine at most (^^ 2 ^) — ‘2r 
distinct distances. 
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Proof. Let uiU 2 ,U 3 Ui E Ui and viV 2 , ■ ■ ■ ,V 2 r-iV 2 r £ U 2 induce a 7^2,r in Q'- Then the points 
vi,V 2 , ■ ■ ■, V 2 r-i,V 2 r are all distinct in V. Now on the other side, either we have that ui,U 2 ,U 3 ,Ui 
are distinct, or we can assume without loss of generality that U 2 = U 3 , say. 

Case 1. Suppose ui, rt2, rta, tt4 are all distinct points. Since (uirt2, UjUj+i), (u3«4, UjUj+i) € Q', for 
every odd integer i E {l,3,5,...,2r— 1}, we get 2r elements of Q', and each such element gives 
a repeated distance. Hence, the number of repetitions is at least 2r, so ui, U2, ua, n4, ui,... U2r 
determine at most — 2r distinct distances. 

Case 2. Suppose U 2 = its. In view of Observation 15.41 (with p = 5), the five points ui,U 2 ,Ui 
and for i odd, determine at most (2) — 2 distinct distances. Hence, tti, U2, U4, ui,... U2r 

determine at most 


2’- +3) -2. 

distinct distances. Now adding any point w to ui, tt2, tf4, ui,... ^2^ gives us 2r + 4 points that 
determine at most 


distinct distances. 


("’' + 3)-2r+(2r + 3) 


2 r + 4^ 


— 2r 


□ 


Suppose that p is even and recall that p > 6 . By Observation 15.51 {Ui,U 2 ,Q') is IL2^(p_4)/2-free, 
since otherwise we would have p points in V that determine at most (2) — p + 4 distinct distances, 
which is a contradiction. By Theorem 15.21 with d = 4 and for a fixed p, we have 

\Q'\ < O < O . 

Together with (|3|) and ([3|), we obtain 


m > Q 



> n 


1 - 1 —±— 
77,-^^14+e 


If p is odd then {Ui, U 2 , Q') is I^2,(p-5)/2-free. Indeed, otherwise by Observation 15.51 we would have 
p — 1 points that determine at most 




2p + 6 


distinct distances. Adding any point to our collection gives us p points that determine at most 
(2) “ P + 5 distinct distances, a contradiction. Just as above, we have 

\Q'\ < O < O . 

Combining this with Q and ([3]), we obtain 


m> Pt 



> n 



= n 



□ 
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6 Ramsey-Turan problem for semi-algebraic graphs 


In this section, we prove Theorem [H The upper bound in Theorem 1 1. 71 follows from 

Theorem 6.1. Let e > 0 and let G = {V,E) be an n-vertex graph where V is a set of n points 
in and E is a semi-algebraie binary relation with complexity at most t. If G is K 2 p-free and 

\E\ > ^ ^1 — then G contains an independent set of size 'yn, where 7 = 'y{d,t,p,e). 

Proof. We apply Theorem 11.11 with m = 1 and parameter e/4 to obtain an equitable partition 
P : 14 = Vi U • • • U Vk such that | < K < (|)'^, where c = c{d, t), and all but an |-fraction of all 
pairs of parts in V are homogeneous (complete or empty with respect to E). If n < 10/4, then G 
has an independent set of size one, and the theorem holds trivially. So we may assume n > lOK. 
By deleting all edges inside each part, we have deleted at most 


K 


'\n/K~\\ 4n^ 

2 y - ^ 

edges. Deleting all edges between non-homogeneous pairs of parts, we lose an additional at most 


n 

K 


n 2 e iv:2 

-< e — 

4 2 “ 5 


edges. In total, we have deleted fewer than 2en^ jb edges of G. The only edges remaining in G are 
edges between homogeneous pairs of parts, and we have at least 5 +4/5^ edges. By 

Turan’s theorem (Theorem [TT|), there is at least one remaining copy of Kp, and these vertices lie in 
p distinct parts 14^,...,!^^ £V that form a complete p-partite subgraph. If any of these parts forms 
an independent set in G, say Vi., then we have an independent set of order \Vi.\ > [n/i4j > yn, 
where 7 = 7(d, t, e,p), and we are done. Otherwise, there is an edge in each of the p parts, and the 
endpoints of these p edges form a K 2 p in G, a contradiction. 

□ 


The lower bound on RT(n,/42p_i, o(n)) and RT(n, i42p, o(ra)) in Theorem 11.71 can be seen by 
the following construction. First, let us recall a result of Walczak. 

Lemma 6.2 ([3S])- For every n > 1, there is a collection S of n/{p — 1) segments in the plane 
whose intersection graph Gg is triangle-free and has no independent set of size cn/log log n where 
c is an absolute constant. 

The construction. In the plane, take p — 1 dilated copies of S, label them as S*!,... ,Sp-i, so 
that Si lies inside a ball with center (i,0) and radius 1/10. Set V = 5i U • • • U Sp-i. Note that 
\Fi\ = 'n/(p — 1) so that |14| = n. Let G = (14, E) be the graph whose vertices are the n segments in 
the plane, and two vertices are connected by an edge if and only if the corresponding segments cross 
or their left endpoints are at least 1/2 apart. The graph G consists of a complete (p — l)-partite 
graph, where each part induces a copy of the triangle-free graph Gg. Clearly, G is i42p_i-free and 
does not contain an independent set of size cn/log log n. Moreover, 

4(0)1 ^ K' - 

Since every segment can be represented by a point in and the intersection and distance rela¬ 
tions have bounded description complexity (see m), E is a symmetric semi-algebraic relation with 
complexity at most c, where c is an absolute constant. 
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